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SEQUENCE LISTING 

1 (1) GENERAL INFORMATION: 

2 (i) APPLICANT: Malcolm Whitman and Xin Chen 

3 (ii) TITLE OF INVENTION: METHODS AND REAGENTS FOR MODULATING 

4 TGF-BETA SUPERFAMILY SIGNALLING 

5 (iii) NUMBER OF SEQUENCES: 18 

6 (iv) CORRESPONDENCE ADDRESS: 

7 (A) ADDRESSEE: Clark & Elbing LLP 

8 (B) STREET: 176 Federal Street 

9 (C) CITY: Boston 

10 (D) STATE: MA 

11 (E) COUNTRY: USA 

12 (F) ZIP: 02110 

13 (v) COMPUTER READABLE FORM: 

14 (A) MEDIUM TYPE: Diskette 

15 (B) COMPUTER: IBM Compatible 

16 (C) OPERATING SYSTEM: DOS 

17 (D) SOFTWARE: FastSEQ for Windows Version 2.0 

18 (vi) CURRENT APPLICATION DATA: 

C--> 19 (A) APPLICATION NUMBER: US/10/044 , 442 

C--> 20 (B) FILING DATE: ll-Jan-2002 

21 (C) CLASSIFICATION: 

22 (vii) PRIOR APPLICATION DATA: 

W--> 23 (A) APPLICATION NUMBER: US/09/087 , 134 

24 (B) FILING DATE: 27-MAY-1998 

W--> 25 (A) APPLICATION NUMBER: 60/047,991 

26 (B) FILING DATE: 28-MAY-1997 

27 (viii) ATTORNEY/AGENT INFORMATION: 

28 (A) NAME: Bieker-Brady , Kristina 

2 9 (B) REGISTRATION NUMBER: 39,109 

30 (C) REFERENCE /DOCKET NUMBER: 00246/501002 

31 (ix) TELECOMMUNICATION INFORMATION: 

32 (A) TELEPHONE: 617-428-0200 

33 (B) TELEFAX: 617-428-7045 

34 (C) TELEX: 

35 (2) INFORMATION FOR SEQ ID NO: 1: 

36 (i) SEQUENCE CHARACTERISTICS: 

37 (A) LENGTH: 1658 base pairs 

38 (B) TYPE: nucleic acid 

3 9 (C) STRANDEDNESS : single 

4 0 (D) TOPOLOGY: linear 

41 (ii) MOLECULE TYPE: cDNA 

42 (ix) FEATURE: 
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43 (A) NAME /KEY: Other 

44 (B) LOCATION: 1. . .1 

45 (D) OTHER INFORMATION: Xenopus Smad2 coding region 

46 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

4 7 ATGTCGTCCA TCTTGCCTTT CACCCCGCCA GTAGTGAAGC GCCTGCTAGG ATGGAAGAAG '60 

48 TCTGCAAGTG GCACCACAGG AGCAGGTGGC GATGAGCAGA ACGGACAGGA AGAGAAGTGG 12 0 

4 9 TGCGAAAAAG CGGTAAAGAG CTTGGTGAAA AAACTGAAGA AAACGGGACA ATTAGACGAG 180 

50 CTTGAGAAGG CGATCACGAC GCAGAACTGC AACACGAAAT GCGTAACGAT ACCAAGCACT 240 

51 TGCTCTGAAA TTTGGGGACT GAGTACAGCA AATAC CAT AG ATCAGTGGGA TACCACAGGC 300 

52 CTTTACAGCT TCTCTGAACA AACCAGGTCT CTTGATGGTC GACTCCAGGT GTCTCACCGT 360 

53 AAAGGATTGC CGCATGTTAT CTACTGCAGA CTGTGGCGCT GGCCAGACCT GCACAGTCAT 42 0 

54 CATGAACTGA AAGCAATCGA AAATTGTGAA TATGCTTTTA ACCTTAAAAA AGATGAAGTT 480 

55 TGTGTCAATC CATACCATTA TCAGAGGGTG GAGACACCAG TTTTACCACC TGTATTAGTT 540 

56 CCACGGCACA CGGAAATCTT GACAGAGCTG CCACCTCTTG ATGACTACAC GCATTCCATT 600 

57 C C AG AAAACA CTAATTTTCC TGCAGGGATT GAACCTCAGA GCAATTATAT TCCAGAAACA 660 

58 CCACCTCCTG GATATATTAG TGAAGATGGA GAAACTAGCG ATCAGCAACT TAACCAAAGC 72 0 

59 ATGGACACAG GGTCACCAGC TGAGCTGTCT CCGAGTACAC TTTCTCCAGT CAACCACAAT 780 

60 CTCGATTTGC AACCTGTCAC CTATTCGGAA CCTGCTTTTT GGTGCTCTAT AGCATACTAC 840 

61 GAACTGAATC AGCGAGTAGG AGAAACTTTC CATGCATCGC AACCATCGCT TAC CGTGGAC 900 

62 GGCTTTACGG ACCCCTCAAA CTCTGAAAGG TTCTGCTTAG GTTTACTCTC AAATGTGAAC 960 

63 CGAAATGCCA CGGTGGAAAT GACCAGGCGT CACATAGGAA GGGGTGTCCG GCTATATTAC 102 0 

64 ATCGGTGGAG AGGTGTTTGC AGAGTGCCTA AGTGATAGTG CTATTTTTGT TCAGAGTCCA 1080 

65 AACTGTAACC AGCGATATGG ATGGCATCCA GCAACTGTAT GTAAGATTCC TCCAGGATGC 1140 

66 AATCTGAAGA TTTTCAATAA TCAAGAGTTT GCGGCTCTCC TCGCTCAGTC TGTGAATCAA 1200 

67 GGCTTTGAAG CAGTTTATCA GTTAACTCGA ATGTGCACCA TAAGGATGAG CTTTGTAAAA 1260 

68 GGCTGGGGTG CTGAATACAG GCGACAGACC GTTACAAGCA CTCCATGCTG GATTGAGCTT 1320 

69 CACCTGAATG GACCTTTGCA GTGGTTGGAC AAAGTGTTGA CACAGATGGG ATCCCCTTCA 1380 

70 GTCCGCTGCT CAAGCATGTC CTAATGGTCT CCTCTTTTTA ATGTATTACC TGCGGGCGGC 1440 

71 AACTGCAGTC CCAGCAACAG ACTCAATACA GCTTGTCTGT CGTAGTATTT GTGTGTGGTG 1500 

72 CCCATGAACT GTTTACAATC CAAAAGAGAG AGAATAAAAA AGCAAAAACA GCACTTGAGA 1560 

73 TCCCATCAAC GAAAAGCACC TTGTTGGATG ATGTTTCTGA TACTCTTAAA GTAGATCCGT 162 0 

74 GTATAAATGA CTCCTTACCT GGGAAAAGGG ACTTTTTC 1658 

76 (2) INFORMATION FOR SEQ ID NO: 2: 

77 (i) SEQUENCE CHARACTERISTICS: 

78 (A) LENGTH: 467 amino acids 

79 (B) TYPE: amino acid 

80 (C) STRANDEDNESS : single 

81 (D) TOPOLOGY: linear 

82 (ii) MOLECULE TYPE: protein 

83 (ix) FEATURE: 

84 (A) NAME /KEY: Other 

85 (B) LOCATION: 1...1 

86 (D) OTHER INFORMATION: Xenopus Smad2 

87 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

88 Met Ser Ser lie Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 

89 1 5 10 15 

90 Gly Trp Lys Lys Ser Ala Ser Gly Thr Thr Gly Ala Gly Gly Asp Glu 

91 20 25 30 

92 Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 
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94 Val Lys Lys Leu Lys Lys Thr Gly Gin Leu Asp Glu Leu Glu Lys Ala 

95 50 55 60 

96 lie Thr Thr Gin Asn Cys Asn Thr Lys Cys Val Thr lie Pro Ser Thr 

97 65 70 75 80 

98 Cys Ser Glu lie Trp Gly Leu Ser Thr Ala Asn Thr lie Asp Gin Trp 
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142 Ser Thr Pro Cys Trp lie Glu Leu His Leu Asn Gly Pro Leu Gin Trp 

143 435 440 445 

144 Leu Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser 

145 450 455 460 

146 Ser Met Ser 

147 465 

149 (2) INFORMATION FOR SEQ ID NO: 3: 

150 (i) SEQUENCE CHARACTERISTICS: 

151 (A) LENGTH: 194 amino acids 

152 (B) TYPE: amino acid 

153 (C) STRANDEDNESS : single 

154 (D) TOPOLOGY: linear 

155 (ii) MOLECULE TYPE: protein 

156 (ix) FEATURE: 

157 (A) NAME /KEY : Other 

158 (B) LOCATION: 1...1 

159 (D) OTHER INFORMATION: Xenopus Smad2 MH2 domain 

160 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

161 Trp Cys Ser lie Ala Tyr Tyr Glu Leu Asn Gin Arg Val Gly Glu Thr 

162 15 10 15 

163 Phe His Ala Ser Gin Pro Ser Leu Thr Val Asp Gly Phe Thr Asp Pro 

164 20 25 30 

165 Ser Asn Ser Glu Arg Phe Cys Leu Gly Leu Leu Ser Asn Val Asn Arg 

166 35 40 45 

167 Asn Ala Thr Val Glu Met Thr Arg Arg His He Gly Arg Gly Val Arg 

168 50 55 60 

169 Leu Tyr Tyr He Gly Gly Glu Val Phe Ala Glu Cys Leu Ser Asp Ser 

170 65 70 75 80 

171 Ala He Phe Val Gin Ser Pro Asn Cys Asn Gin Arg Tyr Gly Trp His 

172 85 90 95 

173 Pro Ala Thr Val Cys Lys He Pro Pro Gly Cys Asn Leu Lys He Phe 

174 100 105 110 

175 Asn Asn Gin Glu Phe Ala Ala Leu Leu Ala Gin Ser Val Asn Gin Gly 

176 115 120 125 

177 Phe Glu Ala Val Tyr Gin Leu Thr Arg Met Cys Thr He Arg Met Ser 

178 130 135 140 

179 Phe Val Lys Gly Trp Gly Ala Glu Tyr Arg Arg Gin Thr Val Thr Ser 

180 145 150 155 160 

181 Thr Pro Cys Trp lie Glu Leu His Leu Asn Gly Pro Leu Gin Trp Leu 

182 165 170 175 

183 Asp Lys Val Leu Thr Gin Met Gly Ser Pro Ser Val Arg Cys Ser Ser 

184 180 185 190 

185 Met Ser 

187 (2) INFORMATION FOR SEQ ID NO: 4: 

188 (i) SEQUENCE CHARACTERISTICS: 

189 (A) LENGTH: 1401 base pairs 

190 (B) TYPE: nucleic acid 

191 (C) STRANDEDNESS: single 

192 (D) TOPOLOGY: linear 
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193 (ii) MOLECULE TYPE: cDNA - 

194 (ix) FEATURE: 

195 (A) NAME /KEY: Other 

196 (B) LOCATION: 

197 (D) OTHER INFORMATION: Human Smad2 coding region 

198 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

199 ATGTCGTCCA TCTTGCCATT CACGCCGCCA GTTGTGAAGA GACTGCTGGG ATGGAAGAAG 60 
2 00 TCAGCTGGTG GGTCTGGAGG AGCAGGCGGA GGAGAGCAGA ATGGGCAGGA AGAAAAGTGG 12 0 

201 TGTGAGAAAG CAGTGAAAAG TCTGGTGAAG AAGCTAAAGA AAACAGGACG ATTAGATGAG 180 

202 CTTGAGAAAG CCATCACCAC TCAAAACTGT AATACTAAAT GTGTTACCAT ACCAAGCACT 24 0 

203 TGCTCTGAAA TTTGGGGACT GAGTACACCA AATACGATAG ATCAGTGGGA TACAACAGGC 300 

204 CTTTACAGCT TCTCTGAACA AACCAGGTCT CTTGATGGTC GTCTCCAGGT ATCCCATCGA 360 

205 AAAGGATTGC CACATGTTAT ATATTGCCGA TTATGGCGCT GGCCTGATCT TCACAGTCAT 42 0 

206 CATGAACTCA AGGCAATTGA AAACTGCGAA TATGCTTTTA ATCTTAAAAA GGATGAAGTA 480 

207 TGTGTAAACC CTTAC CACTA TCAGAGAGTT GAGACACCAG TTTTGCCTCC AGTATTAGTG 540 
2 08 CCCCGACACA CCGAGATCCT AACAGAACTT CCGCCTCTGG ATGACTATAC TCACTCCATT 600 

209 CCAGAAAACA CTAACTTCCC AGCAGGAATT GAGCCACAGA GTAATTATAT TCCAGAAACG 660 

210 CCACCTCCTG GATATATCAG TGAAGATGGA GAAACAAGTG ACCAACAGTT GAATCAAAGT 720 

211 ATGGACACAG GCTCTCCAGC AGAACTATCT CCTACTACTC TTTCCCCTGT TAATCATAGC 780 

212 TTGGATTTAC AGC CAGTTAC TTACTCAGAA CCTGCATTTT GGTGTTCAAT AGCATATTAT 840 

213 GAATTAAATC AGAGGGTTGG AGAAACCTTC CATGCATCAC AGCCCTCACT CACTGTAGAT 900 

214 GGCTTTACAG ACCCATCAAA TTCAGAGAGG TTCTGCTTAG GTTTACTCTC CAATGTTAAC 960 

215 CGAAATGCCA CGGTAGAAAT GACAAGAAGG CATATAGGAA GAGGAGTGCG CTTATACTAC 102 0 

216 ATAGGTGGGG AAGTTTTTGC TGAGTGCCTA AGTGATAGTG CAATCTTTGT GCAGAGCCCC 1080 

217 AATTGTAATC AGAGATATGG CTGGCACCCT GCAACAGTGT GTAAAATTCC ACCAGGCTGT 1140 

218 AATCTGAAGA TCTTCAACAA CCAGGAATTT GCTGCTCTTC TGGCTCAGTC TGTTAATCAG 1200 

219 GGTTTTGAAG CCGTCTATCA GCTAACTAGA ATGTGCACCA TAAGAATGAG TTTTGTGAAA 1260 
22 0 GGGTGGGGAG CAGAATACCG AAGGCAGACG GTAACAAGTA CTCCTTGCTG GATTGAACTT 132 0 

221 CATCTGAATG GACCTCTACA GTGGTTGGAC AAAGTATTAA CTCAGATGGG ATCCCCTTCA 1380 

222 GTGCGTTGCT CAAGCATGTC A 1401 

224 (2) INFORMATION FOR SEQ ID NO: 5: 

225 (i) SEQUENCE CHARACTERISTICS: 

226 (A) LENGTH: 46 7 amino acids 
22 7 (B) TYPE: amino acid 

22 8 (C) STRANDEDNESS : single 

22 9 (D) TOPOLOGY: linear 

23 0 (ii) MOLECULE TYPE: protein 
231 (ix) FEATURE: 

2 32 (A) NAME/KEY: Other 

233 (B) LOCATION: 1...1 

234 (D) OTHER INFORMATION: Human Smad2 
23 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

23 6 Met Ser Ser lie Leu Pro Phe Thr Pro Pro Val Val Lys Arg Leu Leu 

237 15 10 15 

23 8 Gly Trp Lys Lys Ser Ala Gly Gly Ser Gly Gly Ala Gly Gly Gly Glu 
239 20 25 30 

24 0 Gin Asn Gly Gin Glu Glu Lys Trp Cys Glu Lys Ala Val Lys Ser Leu 

241 35 40 45 

242 Val Lys Lys Leu Lys Lys Thr Gly Arg Leu Asp Glu Leu Glu Lys Ala 
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L:19 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:20 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:) 

L:25 M:238 W: Alpha Fields not Ordered, Reordered [(A) APPLICATION NUMBER:] of (1) (vii) 
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